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Abstract 

Background: Triticum monococcum L, an A genome diploid einkorn wheat, was the first domesticated crop. As a 
diploid, it is attractive genetic model for the study of gene structure and function of wheat-specific traits. Diploid 
wheat is currently not amenable to reverse genetics approaches such as insertion mutagenesis and 
post-transcriptional gene silencing strategies. However, TILLING offers a powerful functional genetics approach for 
wheat gene analysis. 

Results: We developed a TILLING population of 1,532 M 2 families using EMS as a mutagen. A total of 67 mutants 
were obtained for the four genes studied. Waxy gene mutation frequencies are known to be 1/17.6 - 34.4 kb DNA 
in polyploid wheat TILLING populations. The T. monococcum diploid wheat TILLING population had a mutation 
frequency of 1/90 kb for the same gene. Lignin biosynthesis pathway genes- COMT1, HCT2, and 4CL1 had mutation 
frequencies of 1/86 kb, 1/92 kb and 1/100 kb, respectively. The overall mutation frequency of the diploid wheat 
TILLING population was 1/92 kb. 

Conclusion: The mutation frequency of a diploid wheat TILLING population was found to be higher than that 
reported for other diploid grasses. The rate, however, is lower than tetraploid and hexaploid wheat TILLING 
populations because of the higher tolerance of polyploids to mutations. Unlike polyploid wheat, most mutants in 
diploid wheat have a phenotype amenable to forward and reverse genetic analysis and establish diploid wheat as 
an attractive model to study gene function in wheat. We estimate that a TILLING population of 5, 520 will be 
needed to get a non-sense mutation for every wheat gene of interest with 95% probability. 
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Background 

Common or bread wheat Triticum aestivum L. is an 
allohexaploid having three genomes A, B and D and a 
huge genome size of -17 Gb [1,2]. For gene functional 
analysis, all the three homoeologous loci have to be indi- 
vidually targeted and subsequently combined to evaluate 
the phenotypes. This may take years. Although this is 
necessary for many genes of agronomic interest, it is 
cumbersome for other genes where the biological func- 
tion has not been validated in wheat. Triticum monococ- 
cum is a cultivated A-genome diploid wheat and can be 
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used to study traits, genes, and alleles as a model for 
bread wheat. Establishing phenotype to genotype rela- 
tionships and allocating function of variant alleles in 
diploid wheat is considerably straightforward. As an al- 
ternative to studying gene functions in hexaploid wheat 
it is feasible to mine databases of model species such as 
rice and Br achy podium and apply it to T monococcum. 
The knowledge of gene function in diploid wheat will be 
applicable to hexaploid wheat and, furthermore, any use- 
ful novel trait found in diploid wheat can be transferred 
to hexaploid wheat via established breeding procedures. 

Targeting Induced Local Lesions in Genomes (TIL- 
LING) is a powerful reverse genetics approach first 
developed in the model plant Arabidopsis and fruitfly 
Drosophila melanogaster [3,4]. Subsequently it has been 
successfully applied to many plants including maize [5], 
barley [6,7], rice [8-10], sorghum [11], hexaploid and 
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tetraploid wheat [10-15], soybean [16], oat [17], Brassica 
[18] and, tomato [19]. TILLING has become the method 
of choice for the functional analysis of genes, because it 
can be applied to a broad range of organisms and prac- 
tically any gene of interest [20]. Furthermore, mutagen- 
esis is achieved in TILLING without the addition of 
transgenic DNA and is stable as compared to other re- 
verse genetics tools like insertional mutagenesis, RNA 
interference (RNAi) [21,22], and genome editing using 
ZFNs and TALENs [23,24]. TILLING also produces a 
broad range of alleles including nonsense, missense, spli- 
cing and, cis-regulatory, which may be used to assign 
functional domains of proteins [9]. 

TILLING in hexaploid wheat has been reported by sev- 
eral groups [10,12-15,25,26]. However, TILLING in diploid 
wheat, although desirable, has not been attempted. To ex- 
ploit the distinct advantages of TILLING in diploid wheat 
for functional genomics studies, we developed and charac- 
terized an EMS-induced TILLING population consisting 
of 1,532 fertile/partially fertile M 2 families. The utility and 
mutation frequency of the population was determined 
using genes waxy, caffeic acid O-methyltransferase 1 
(COMT1), 4-coumarate-CoA ligase 1 (4CL1), and hydroxy- 
cinnamoyl-CoA:shikimate/quinate hydroxycinnamoyltrans- 
f erase (HCT2). The waxy gene was used because mutation 
frequencies are already established for it in hexaploid 
wheat [12]. COMT1, 4CL1, and HCT2 are three important 
genes in the lignin biosynthetic pathway of plants [27,28], 
and the frequency of mutation for COMT1 is known in a 
sorghum TILLING population [11] for comparative muta- 
tion frequency analysis. 

Results 

Development of the T. monococcum TILLING population 

Our pilot experiment showed that a 0.24 % EMS muta- 
genesis treatment of T. monococcum accession TA4342- 
96 had a kill rate of 50% and produced many phenotypic 
mutants [Figure 1]. Ninety percent of the surviving M 2 
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Figure 1 EMS Dosage optimization curve. A dose of 0.24 % EMS 
was applied to develop T. monococcum (TA4342-96) TILLING 
population at a kill rate of 51%. 



individuals were fully/partially fertile. Increasing the 
dose would compromise the survival and fertility of 
plants, and a lower dose would not yield enough muta- 
tions to saturate the genome. Out of a total of 3,700 
T. monococcum seeds treated (M 0 ), 1,890 Mi plants 
survived and set seed. One seed per spike was sown 
from each Mi plant to reduce mutation redundancy 
and 1,700 viable M 2 plants were obtained. One hun- 
dred and sixty-eight M 2 plants were sterile and did 
not produce any seeds. Plant tissue was collected and 
catalogued from the fully/partially fertile 1,532 M 2 
plants and selfed M 3 seed from these was archived. 

Several phenotypic mutants, such as albino, chlorina, 
striped, dwarf, grassy-shoot, broad leaf, early/late flower- 
ing, and delayed senescence were observed in the M 2 
generation. Figure 2 gives the number of phenotypic 
mutants of each type observed in the population. 

Sequencing of the T. monococcum lignin genes 

Primers were designed to cover only the exonic regions 
of the three lignin genes studied, so as to avoid 
translationally-inactive intronic regions, using publically 
available hexaploid wheat mRNA sequences, rice 
genomic sequences, and wheat-rice homology. The 
exons of T. monococcum COMT1, HCT2, and 4CL1 
were PCR amplified and sequenced using the designed 
primers. The sequences were submitted to NCBI. 
The Genbank accession numbers for T. monococcum 
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Figure 2 Number and types of phenotypic mutants observed in 
the TILLING population. 
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COMT1, HCT2, and 4CL1 genes (exonic regions) are 
JX473722, JX473723, and JX473724, respectively. 

Mutant identification by TILLING 

Four genes: waxy, COMT1, 4CL1, and HCT2 were used 
to characterize the mutation frequency in the TILLING 
population (Table 1). For screening the waxy locus, two 
primer pairs, Wl [29] and DOS (designed from sequence 
of T. monococcum wx-Tma gene, Genbank Accession 
No. AF1 10373), were used. We found a total of 27 
mutants for the waxy locus, 13 for Wl, and 14 for DOS, 
by scanning a total of 2,438 kb of DNA. Two primer pairs 
each were designed for COMT1, 4CL1, and HCT2 cover- 
ing their exonic regions. For 4CL1, however, last three 
exons were combined because of their very small size to 
generate a single amplicon (primer pair 4CL1_C). 
COMTl_A and COMTl_B were designed on the basis of 
the hexaploid wheat COMT1 gene sequence. For 
COMTl_A and COMTl_B, 6 mutants each were found 
in a total of 949 kb scanned. For HCT2_A and HCT2_B, 
4 and 12 mutants, respectively, were detected by scan- 
ning a total of 1,452 kb. Five and seven mutants were 
found for 4CL1_B and 4CL1_C, respectively, in a total of 
1,248 kb scanned. The mutation frequency for the waxy 
gene was found to be 1/90 kb, whereas for COMT1, 
HCT2, and 4CL1 the frequencies were 1/86 kb, 1/92 kb, 
and 1/100 kb, respectively. The overall mutation fre- 
quency of our TILLING population on the basis of the 
all four genes was 1/92 kb. 

Of the total 67 mutants identified, 51 were heterozy- 
gous and 16 were homozygous, which fits the expected 
ratio of 2:1 heterozygous:homozygous M 2 plants 



(X 2 = 1.796, P<0.01). Table 2 gives the details of the 
mutants found for all four genes. 

Sequencing of the mutants 

All 67 mutations identified were G/C > A/T transitions. 
Of the 27 mutations identified for the waxy locus, eight 
were in the non-coding regions. Nine of the 19 muta- 
tions in the coding region were silent, because the SNP 
occurred in the third wobbling base of the triplet codon. 
The remaining 10 mutants of the waxy locus had mis- 
sense mutations. For 4CL1, one non-sense mutation was 
found in 6-9-G10, where a G>A mutation converted a 
tryptophan to a stop codon. Of the remaining 12 muta- 
tions, four were in the introns, three were silent, and 
four were mis-sense. Because the exon size of the HCT2 
and COMT1 genes encompassed the amplicon, all iden- 
tified mutations were within the coding regions of these 
genes. In these two genes, 39% of the mutations were 
found to be silent; the remaining 61% were mis-sense. 

To assess the probability of phenotypic effect of the 
mis-sense mutations, bioinformatic tools SIFT (Sorting 
Intolerant From Tolerant) [30] and PSSM (Position spe- 
cific scoring matrix) [31] were employed. For a mutation 
to be intolerant, the SIFT scores should be less than 
0.05, whereas high positive PSSM scores (>6) suggest 
deleterious effects of the mutation on protein function. 
The SIFT scores of all the identified mis-sense mutations 
in COMT1, HCT2, and 4CL1 were more than 0.1 and 
their PSSM scores were low, indicating the mis-sense 
mutations were notdeleterious. All SIFT scores were in 
agreement with the PSSM values. 



Table 1 Details of the primers and mutation frequencies found for genes in the T. monococcum TILLING population 



Primer Name 


Gene 


Sequence 5/-3/ 


Number of 
mutants identified 


Product size 


No. of bases 
screened 


Gene-wise mutation 
frequency 


W1_F 


waxy 


TOCTGCAGGTAGCCACAC 


13 


931 


1141034 


1/90 kb 


W1_R 




CTCAAGTGCTGCGTGGGAGAGAA 










DOS_F 




AGGACAACCAGCTACGCTOAG 


14 


1058 


1296685 




DOS_R 




TGATGACGTACCAGGAGAACGA 










COMT1_AF 


COMT1 


CATGTACGCTCTCCAGCTCGTCTC 


6 


379 


464502 


1/86 kb 


COMT1_AR 




AGCTCTCCATGAGGACCHTGTC 










COMT1_BF 




GAAGCACTCCATCATCATCACC 


6 


466 


571130 




COMT1_BR 




CGAACTCCCTCTCGTACCTCTC 










HCT2_AF 


HCT2 


CGCCGAGCGTCTACTOTO 


4 


308 


377485 


1/92 kb 


HCT2_AR 




GAGGAGCGGGTAGGAGGAGAT 










HCT2_B 




GGCCTGCACTOATCAACTC 


12 


877 


1074851 




HCT2_BR 




CGGTGC^CmCmCTGC 










4CL1_BF 


4CL1 


AGGACGCCTOATGGCTAAGATCC 


5 


317 


388515 


1/100 kb 


4CL1_BR 




TCATGATCTGCTCGCCGCGGATGC 










4CL1_CF 




AGAGTCCACCAAGAACACCATC 


7 


701 


859146 




4CL1_CR 




CTGGCTCTCAAGTCCTOCTC 
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Table 2 Nature of mutations found in the T. monococum TILLING population 



Primer set 


Plant ID 


Base pair 


Homo/heterozygous 


Type of 


Amino acid 


Position of 


SIFT 


PSSM 


(F+R) 


(Pool-Plate-rowcolumn) 


change 




mutation 


change 


amino acid* 


score 


score 


C0MT1_A 


1 -4-G6 


C>T 


Hetero 


Mis-sense 


Thr>Met 


90 


0.14 


0 




2-1 -E6 


G>A 


Hetero 


Silent 




91 








4-1 -A3 


G>A 


Homo 


Mis-sense 


Ala>Lys 


95 


0.36 


+ 1 




4-1 -A1 1 


G>A 


Hetero 


Mis-sense 


Glu>Lys 


92 


0.24 


+ 1 




5-7-D6 


G>A 


Hetero 


Mis-sense 


Ala>Thr 


65 


0.25 


+4 




6-10-G8 


G>A 


Homo 


Mis-sense 


Glu>Lys 


116 


0.23 


+ 1 


C0MT1_B 


1-2-B8 


C>T 


Hetero 


Mis-sense 


Ala>Val 


275 


0.45 


+4 




3-3-E4 


G>A 


Hetero 


Silent 




298 








4-3-B10 


G>A 


Hetero 


Mis-sense 


Met>lle 


261 


0.55 


0 




4-4- H 5 


G>A 


Homo 


Mis-sense 


Glu>Lys 


271 


0.41 


-1 




5-8-A1 


G>A 


Hetero 


Mis-sense 


Glu>Met 


298 


0.16 


-4 




6-10-F3 


C>T 


Hetero 


Mis-sense 


Pro>Ser 


248 


0.42 


+2 


HCT2_A 


1-3-B12 


G>A 


Hetero 


Mis-sense 


Gly>Arg 


131 


0.25 


-2 




2-5-G6 


G>A 


Hetero 


Silent 




100 








5-8-C3 


C>T 


Homo 


Silent 




72 








5-6-C1 1 


G>A 


Hetero 


Silent 




85 






HCT2_B 


1-1 -A4 


G>A 


Hetero 


Mis-sense 


Glu>Lys 


208 


0.45 


+ 1 




1-2-E7 


G>A 


Homo 


Silent 




371 








2-1 -El 2 


C>T 


Hetero 


Silent 




395 








2-6-H8 


G>A 


Hetero 


Mis-sense 


Asp>Asn 


330 


0.56 


-2 




3-2-G8 


C>T 


Homo 


Silent 




359 








3-2-F5 


C>T 


Hetero 


Mis-sense 


Thr>Met 


314 


0.19 


-1 




4-2-A9 


G>A 


Homo 


Mis-sense 


Ala>Thr 


275 


0.50 


-3 




4-1-B1 


C>T 


Hetero 


Silent 




308 








4-3-G4 


G>A 


Hetero 


Mis-sense 


Va I > 1 1 e 


310 


0.59 


+2 




5-8-C8 


C>T 


Hetero 


Mis-sense 


Leu>Phe 


267 


0.37 


+2 




6-10-A8 


G>A 


Hetero 


Silent 




287 








6-10-F1 1 


G>A 


Hetero 


Silent 




414 






4CL1_B 


2-1-H5 


G>A 


Hetero 


Mis-sense 


Met>lle 


405 


0.31 


-3 




3-4-C10 


C>T 


Hetero 


Mis-sense 


Thr>Met 


345 


0.1 1 


-3 




4-1 -E2 


G>A 


Hetero 


Silent 




340 








5-8-A10 


G>A 


Homo 


Mis-sense 


Gly>Asp 


370 


0.28 


-7 




6-9-G10 


G>A 


Hetero 


Non-sense 


Trp>Stop 


338 






4CL1_C 


1-2-D3 


C>T 


Hetero 


Intronic 












1-1 -A7 


G>A 


Hetero 


Intronic 












3-4-B7 


C>T 


Hetero 


Intronic 












4-3-B6 


C>T 


Hetero 


Intronic 












5-7-E8 


G>A 


Homo 


Mis-sense 


Val>lle 


476 


0.27 


+ 1 




5-6-B8 


C>T 


Hetero 


Silent 


Gly 


485 








6-9-H8 


C>T 


Hetero 


Silent 


Gly 


534 






W1 


1-2-C5 


G>A 


Hetero 


Mis-sense 


Gly>Ser 


95 


0.20 


-4 




1-1-D7 


C>T 


Hetero 


Mis-sense 


Ala>Val 


58 


0.19 


Not found** 




2-1-H5 


C>T 


Hetero 


Silent 




104 








2-5-E5 


C>T 


Hetero 


Intronic 












3-3-B3 


G>A 


Homo 


Mis-sense 


Gly>Glu 


101 


0.17 


-1 




3-3-E3 


C>T 


Hetero 


Intronic 
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Table 2 Nature of mutations found in the T. monococum TILLING population (Continued) 



3-2-G3 


C>T 


Hetero 


Silent 




60 






4-3-B5 


G>A 


Homo 


Silent 




51 






4-4- F9 


G>A 


Hetero 


Intronic 










4-4-G9 


G>A 


Hetero 


Mis-sense 


Arg>His 


153 


0.09 


+1 


5-8-E6 


C>T 


Homo 


Intronic 










6-10-F1 


C>T 


Hetero 


Intronic 










6-9-H7 


G>A 


Hetero 


Mis-sense 


Glu>Lys 


143 


0.14 


-1 


DOS 1 -4-D8 


G>A 


Hetero 


Silent 




253 






1 -2-G4 


G>A 


Hetero 


Mis-sense 


Glu>Lys 


328 


0.12 


-2 


2-5-D6 


G>A 


Hetero 


Intronic 










2-6-G4 


G>A 


Homo 


Intronic 










3-2-D1 


G>A 


Homo 


Silent 




359 






3-3-G9 


G>A 


Hetero 


Mis-sense 


Val>Met 


254 


0.10 


-6 


j-4-nz 


L> I 


Hetero 


Mis-sense 


Aia> vai 


one; 


U. I o 


—5 


4-3-F4 


G>A 


Homo 


Intronic 










4-1 -H2 


G>A 


Hetero 


Silent 




294 






5-7-A8 


C>T 


Hetero 


Mis-sense 


Ala>Val 


399 


0.11 


+2 


5-7-D1 


G>A 


Hetero 


Silent 




315 






5-5-H1 


C>T 


Hetero 


Mis-sense 


Ala>Val 


252 


0.10 


+2 


6-10-E6 


G>A 


Hetero 


Silent 




277 






6-9-F6 


C>T 


Homo 


Silent 




346 







*Position of amino acid acid change in rice protein, where wheat reference not available. 
**PSSM not found, compositionally biased region, not used in domain database search. 



Lignin quantification 

Wild type T. monococcum was found to contain 21.1% 
lignin content in their spikelets. Homozygous M 3 indivi- 
duals of four mutants from each COMT1, 4CL1, and 
HCT2 primer pair were analyzed in six replications for 
lignin content. The mutant with non-sense mutation in 
4CL1_B had 19.6% lignin content in the spikelets, which 
is 7.7% lower than that in the wild type. However, 
this difference was found to be statistically insignificant 
(X 2 =0.124, P<0.05). Other mutants had lignin content in 
the range of 17.3-22.7%, which also were not signifi- 
cantly lower than the wild type. This was as expected 
based on their SIFT and PSSM predictions. 

Discussion 

TILLING, based on chemically induced mutations, has 
become a method of choice for reverse genetics studies, 
because it can be applied to a broad range of organisms 
to scan for mutations in any gene of interest. Greene 
et al. [20] showed that EMS -induced mutations are ran- 
domly distributed across the entire genome and, theoret- 
ically, all the genes are responsive to mutagenesis and 
TILLING, contrary to post-transcriptional gene silencing 
(PTGS) techniques, which have variable success rates 
with some organisms/genes. Once a good TILLING re- 
source has been developed for an organism, it can be 
used to study any gene function in that organism, 



whereas with other reverse genetics study approaches, 
resources have to be developed individually for each 
gene. Additionally, allelic series of genes obtained by 
TILLING can provide information on important 
domains or amino acids within the protein of interest 
[9,32]. In Arabidopsis, hexaploid wheat, and maize, more 
than 99% of the EMS-induced mutations are G/C > A/T 
transitions [5,12,20]. All the mutations found in diploid 
wheat were also G/C > A/T transitions. In barley, rice, 
soybean and Drosophila, however, other transitions and 
transversions, have been reported to occur in 10-30% of 
the cases [6,9,16,33]. 

Wheat has different levels of polyploidy ranging from 
diploid to hexaploid. Polyploids, such as hexaploid bread 
wheat, tetraploid durum wheat, and hexaploid cultivated 
oat, are easily amenable to TILLING probably because 
genome buffering enables plants to survive a high level 
of mutation [12-14,17]. However diploid plants like Ara- 
bidopsis, rice, diploid wheat, and barley have non- 
redundant complements of genes and are expected to be 
less tolerant of increasing mutation density. Therefore, 
creating mutant populations in a diploid species is a very 
careful exercise, because the mutagenesis treatment has 
to be such that it induces mutation but at the same time 
does not lead to all defective or sterile plants [32]. Hexa- 
ploid and tetraploid wheat can tolerate an EMS treat- 
ment as high as 1% and 0.75%, respectively, whereas 
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with our diploid wheat only 3% of the Mi plants sur- 
vived a 0.5% EMS treatment. Similar low tolerance rates 
with EMS mutagenesis have been reported in other dip- 
loid plants such as Arabidopsis [21,34], rice [8,9], barley 
[6,35,36], and sorghum [11]. The T. monococcum TIL- 
LING population developed in this study can be used as 
a model for functional genomics, applicable to tetraploid 
and hexaploid wheat. Presence of only one homoeolo- 
gous locus for any gene in diploid wheat, as against two 
or three homologous loci in tetraploid wheat and hexa- 
ploid wheat, respectively makes this population useful 
for validating gene function and structure. Information 
from model plants like rice, Brachypodium, or Arabidop- 
sis can be applied directly to diploid wheat. In this study 
information from rice genomic database was used to 
predict gene structure of T. monococcum lignin genes 
and primers were designed for covering only exonic 
regions for mutation detection to find maximum func- 
tional mutations. 

Ce/-I-based single-nucleotide specific mismatch cleav- 
age was used to detect mutations in the genes of interest 
as it is a robust technique of identifying SNPs with 
nearly zero false detection rate [11,13,20]. Agarose gel 
platforms were used due to their quick availability and 
cost effectiveness [25,36,37]. However, this method is 
not very high throughput due to the increased signal-to- 
noise ratio with higher pooling in agarose gels. With the 
reduction in costs of next generation sequencing (NGS), 
it is now possible to TILL several genes simultaneously 
using TILLING-by-sequencing [10,38,39]. Screening for 
mutations using NGS can be done along two lines; first 
by sequencing pooled amplicons [10,38] and second by 
sequence capture [39,40]. The former has high through- 
put for the number of individuals for some genes, 
whereas the latter is suitable for scanning many genes 
within a few individuals [10]. Both approaches involve a 
considerable amount of work preparing sequencing li- 
braries. Analysis of NGS reads also requires extensive 
software treatment and, because the scope of our inves- 
tigation was limited to a few genes, it was more practical 
to use the traditional Cel-l assay. 

The waxy gene, encoding granule bound starch syn- 
thase I (GBSSI), has been widely studied in wheat. 
Knockouts of waxy have starch composed entirely of 
amylopectin and no amylose, and have been developed 
by combining null alleles at all the three loci (ABD) by 
breeding [41]. TILLING has been used to identify 
mutants at all the functional waxy homoeologous loci in 
hexaploid wheat [12,25]. Slade et al. [12] reported a very 
high mutation frequency of 1/18 kb DNA screened in 
hexaploid bread wheat, followed by that of tetraploid 
durum wheat (1/34 kb DNA). The low mutation fre- 
quency for the same waxy locus in our diploid wheat 
TILLING population (1/90 kb) indicates that a decrease 



in ploidy level reduces tolerance to mutations due to 
lack of genome buffering. 

The 4CL genes are a multigene family and have been 
investigated extensively in dicots and gymnosperms for 
their role in lignin biosynthesis [42-45]. In dicots such as 
Arabidopsis and Populus trichocarpa, 4CL genes have 
been classified into two types, type I (4CL7, 4CL2, and 
4CL4), regulate lignin accumulation, and type II (4CL3) 
is mainly responsible for the synthesis of other phenolic 
compounds [42,46]. However, monocots have different 
monolignol composition and phenolic compounds and 
alternative genes have been proposed to play varying 
roles in lignin synthesis [47,48]. 

Gui et al. [48] studied the catalytic properties and roles 
of all the five rice 4CL genes in lignin biosynthesis by 
suppressing their expression using antisense RNA. Sup- 
pression of only 4CL3 resulted in a significant decrease 
in lignin content, poor plant growth, and abnormal an- 
ther development. Suppression of 4CL1 did not affect 
lignin content in rice. Thus, it was concluded that 
monocots have different genes than dicots playing dis- 
tinct roles in the lignin biosynthesis pathway [48]. Our 
findings with a truncation mutant of the wheat 4CL1 
gene showing insignificant reduction in lignin content 
may also be due to the same reason. Our previous RNAi 
studies to suppress 4CL1 genes in hexaploid wheat and 
did not detect any change in lignin content (Bi et al. un- 
published results). More experimental proof is needed 
however, before ruling out 4CL1 as the major determin- 
ant in monocots in general. Based on sequence similar- 
ity, another class, type III 4CL, was added in monocots 
by Gui et al. [48] (including rice 4CL3, 4CL4, 4CL1 and 
4CL5) to the known dicot classes. 

For the COMT1 gene, Xin et al. [11] screened 768 
EMS mutagenized lines of sorghum by TILLING and 
found two mutants by screening 624.4 kb. The frequency 
of mutation in the gene was 1/312 kb in their popula- 
tion. Our TILLING population had quite a high 
mutation frequency for the COMT1 gene (1/92 kb). 
However, brown midrib was not observed in any of the 
mutants because none of the identified mutants for 
COMT1 had a truncation or mis-sense mutation with a 
deleterious effect. 

Diploid species such as barley, sorghum, pea, peanut, 
and rice have low mutation frequencies ranging from 
1/300 to 1,000 kb compared to the polyploids such as 
oat, bread wheat, and soybean [5,6,10,11,15,16]. Our 
TILLING population has higher frequency of mutation 
(1/104 kb) than the other diploid grasses (Table 1). 
Martin et al. [34] developed a TILLING population of 
Arabidopsis in the Landsberg erecta genetic background 
and were able to achieve a high mutation frequency 
(1/89 kb) by selecting for plants with low fertility in the 
Mi generation. The frequencies of mutations vary across 
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genes, species, mutagen, treatment procedure, and de- 
tection strategies [13]. In fact, in wheat, different fre- 
quencies of mutations have been reported for different 
genes, sometimes even in the same genetic background 
[13,14]. Frequencies of mutation for the same gene and 
genotype are variable even across the three wheat gen- 
omes [12,13]. Variable frequencies of mutation have 
been reported in wheat for Starch branching enzyme Ha 
{SBEIIa, 1/49 to 1/124 kb), Wheat Kinase Startl (WKS1, 
1/37 to 1/60 kb), WKS2 (1/36 to 1/42 kb), waxy (1/12 
kb to 1/41 kb), Puroindoline a (Pina, 1/31 kb), and Pinb 
(1/29 kb) [12,25,26]. The mutation frequency (1/92 kb) 
of our TILLING population establishes it as a suitable 
high-throughput, reverse genetics resource of wheat. 

About 5% of the mutants found in TILLING popula- 
tions of Arabidopsis, rice, and wheat have been 
reported to be truncation mutants [12,32,34,49]. Be- 
cause diploids have a lower frequency of mutation, lar- 
ger populations are needed to get at least one 
truncation event per gene compared to polyploids. 
Table 3 summarizes the frequencies of mutations 
reported in various species and gives an estimate of the 
approximate number of lines required to get at least 
one truncation mutant with a 95% probability for each 
gene in a typical lkb coding region screened, assuming 
5% of the total mutations are truncations. With the 
current mutation frequency in our T. monococcum 
population, about 5,520 mutagenized individuals will be 
needed to get one stop codon for any gene of interest 
with 95% probability. 



Conclusions 

The TILLING population developed in diploid wheat 
will be a useful genetic resource as a model system for 
studying wheat gene function as a complement to simi- 
lar analysis in polyploid wheat. Because a population of 
at least 5,520 M 2 individuals will be needed to obtain at 
least one knock-out per gene, a new set of mutagenized 
population in the same genetic background is currently 
being developed to increase the population size. Seed of 
this population are being increased and will be made 
available after the M 4 generation for interested workers 
upon request. Next generation sequencing will be uti- 
lized to characterize mutation in genes of interest and 
increase the high throughput of this resource for wheat 
functional genomics analysis. 

Methods 

Plant material and EMS mutagenesis 

Diploid wheat, Triticum monococcum subsp. monococcum 
(accession number TA4342-96) was used to develop the 
TILLING population. TA4342-96 is a spring-type geno- 
type and has a planting to heading date of -90 days. All 
the plants were grown in greenhouse at 20-25°C with a 
light period of 16 h. 

To determine the appropriate concentration of the 
mutagen EMS, two rounds of tests were made. First, the 
dose of the EMS needed to achieve 40-60 % survival 
among the Mi plants was determined. Five sets of 50 
seeds of TA4342-96 were soaked in water in 250ml glass 
flasks for 8 hours of imbibition on a shaker at 75 rpm 



Table 3 Estimated population size required to identify a truncation mutant in some TILLING populations* (Modified 
from Parry et al. 2009 [32]) 



Species 


Ploidy 


Frequency of 
mutation 


Population size 
used 


Population size 
required 


Reference 


Arabidopsis 


diploid 


1/300 kb 


3 072 


18 000 


Greene et al. 2003 [20] 




diploid 


1/89 kb 


3 712 


5 340 


Martin et al. 2009 [34] 


Sorghum 


diploid 


1/526 kb 


768 


31 560 


Xinetal. 2008 [11] 


Rice 


diploid 


1/294 kb 


768 


17 640 


Till etal. 2006 [51] 




diploid 


1/135 kb 


767 


8 100 


Suzuki et al. 2008 [49] 


Barley 


diploid 


1/1000 kb 


9 216 


60 000 


Caldwell et al., 2004 [6] 




diploid 


1/374 kb 


4 906 


22 440 


Talame et al. 2008 [7] 


Einkorn wheat 


diploid 


1/92 kb 


1 532 


5 520 


This study 


Durum wheat 


tetraploid 


1/40 kb 


768 


2 400 


Slade etal., 2005 [12] 




tetraploid 


1/51 kb a 


1 386 


3 060 


Uauy etal., 2009 [13] 


Arabidopsis 


tetraploid 


<1/100 kb 


528 


6 000 


UC Davis Genome Center 


Bread wheat 


hexaploid 


1/24 kb 


1 152 


1 440 


Slade etal., 2005 [12] 




hexaploid 


1/23-38 kb b 


2 348 


1 380 


Dong et al., 2009 [25] 




hexaploid 


1/38 kb a 


1 536 


2 280 


Uauy etal., 2009 [13] 


Oat 


hexaploid 


1/20-40 kb 


2 550 


1 200 


Chawadeetal. 2010 [17] 



*at 95 % probability in some TILLING populations, assuming 5% of the mutations being non-sense, 
formalized to 50% G/C content, 
depending upon cultivar and EMS treatment. 
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and then treated with five different doses (0.1, 0.2, 0.3, 
0.4, and 0.5 %) of EMS for 16 hours on shaker at 75 
rpm. The treated seeds were washed under running 
water for 8 hours and then transplanted individually into 
root trainers. Observations were made 15 days after 
transplanting to estimate the survival frequency. EMS 
doses 0.2 and 0.25% were nearest to the targeted percent 
survival of plants. A second round of experiment was 
performed with six doses of EMS (0.20, 0.21, 0.22, 0.23, 
0.24, and 0.25%) using 100 seeds per treatment. A kill 
rate of 50% was assumed to be desirable, because it gen- 
erated several phenotypic mutants and at the same time 
yielded majority of the surviving plants fully/partially 
fertile. The 0.24% EMS treatment gave the desirable kill 
rate near 50% and was used as the dose of choice for 
treating a total of 3,700 T. monococcum seeds (Figure 1). 
The Mi population (derived from the M 0 EMS -treated 
seed) was selfed. A single M 2 was grown from every 
Mi plant to prevent genetic redundancy. From a total 
1,700 M 2 individuals 168 were sterile. Tissue was col- 
lected, and the seed cataloged at maturity for all 1,532 
fertile/partially fertile M 2 individuals of the TILLING 
population. 

Development of DNA pools 

Leaf tissue from all 1,532 M 2 individuals was collected at 
the four-leaf stage in 96-well blocks. DNA was isolated 
using a Qiagen Biosprint 96 robot with Biosprint 96 
DNA plant kit (Qiagen, Valencia, CA) according to the 
manufacturers instructions. DNA was quantified on a 
Nano-drop and normalized to 25ng/|il in 96-well blocks. 
Subsequently, 4x pooling was done using 200|il of nor- 
malized DNA from each pool member. The mutants 
were catalogued and their DNA was identified with a 
unique ID as Pool- Plate-Row-Column. 

Design of primers for genes of interest 

Four genes, waxy, COMT1, 4CL1, and HCT2, were used 
to characterize the mutation frequency of the TILLING 
population. The waxy locus, with a known frequency of 
mutation in hexaploid and tetraploid wheat [12,25], was 
scanned for making a direct comparison of mutation 
frequency with ploidy level. However, the A-genome 
specific primers of Slade et al. [12] (WXA2 and WXA3) 
did not amplify in our T. monococcum accession. The 
Wl primer pair from Yan et al. [29] was used, and an 
additional primer pair, named DOS, was designed from 
T. monococcum wx-Tma gene (Genbank Accession No. 
AF1 10373). Both these primer pairs covered the exonic 
as well as the intronic regions of the gene. The primer 
pairs Wl and DOS produced amplicons of 931 and 
1,058 bp, respectively, and were designed such that they 
had an overlap of 47 bp. 



Three important genes of the lignin biosynthetic 
pathway, COMT1, HCT2, and 4CL1, were also used to 
validate the frequency of mutation found at the waxy 
locus. Complete cds of the hexaploid wheat COMT1 
gene (Genebank Accession no. AY226581) was used to 
design primer pairs. Two primer pairs, COMTl_A and 
COMTl_B, were designed to cover the exonic regions 
on the basis of the hexaploid wheat COMT1 gene se- 
quence (Genebank Accession no. AY226581). For 
genes 4CL1 and HCT2, the TaGI database (version 
1 1.0, http://compbio.dfci.harvard.edu/cgi-bin/tgi/gimain. 
pl?gudb= wheat) was searched by BLAST against rice 
loci LOC_Os06g44620 and LOC_Os02g39850, respect- 
ively. TC284202 and TC304595 were used to design 
two primer pairs each for 4CL1 and HCT2, respect- 
ively. Because full-length gene sequences were unavail- 
able for the target genes, homology between rice and 
wheat genes was used to identify exons from the 
available wheat cDNA sequences using the web tool 
SPIDEY [50]. With this approach, the exonic regions 
of genes were determined and primers were designed 
to cover them. For 4CL1_C however, the last three exons 
were combined to generate a single amplicon because of 
their very small size. Two primer pairs each, for all the 
three genes, covering all the exons and yielding single 
bright amplicons, were employed for scanning the TIL- 
LING population. The sequences of exons and encom- 
passed intronic regions of genes COMT1, HCT2, and 
4CL1 were submitted to NCBI and have been assigned 
Genbank accession numbers JX473722, JX473723, and 
JX473724, respectively. 

PCR, Ce/-I digestion, and mutant visualization 

The target regions were amplified from pooled DNA 
using Biolase PCR kits (Bioline, Tauton, MA, USA) in 
25 \A reaction volume, on a BioRad thermocycler 
(BioRad, Hercules, CA, USA). All 1,532 pooled M 2 indi- 
viduals were screened for mutations in all the four genes 
studied. A touchdown profile (95°C-5 / , 7 cycles of 95°C- 
l', 67-60°C-l / with a decrease of 1°C per cycle, 72°C-1 / , 
followed by 30 cycles of 95°C-30s, 60°C-30s, 72°C-45s, 
and a final extension of 72°C-7 / ) was used. PCR products 
were subsequently denatured and slowly reannealed to 
form heteroduplexes between mismatched DNA (95°C- 
2', 5 cycles of 95°C-01s, 95-85°C-l / with a decrease of 
2°C per cycle, and 60 cycles of 85-25°C- 10s. Home-made 
Cel-l endonuclease was extracted from celery according 
to Till et al. [51] and optimized using a SURVEYOR 
Mutation Detection Kit (Cat. No. 706020, Transgenomic 
Inc., Omaha, NE, USA). For optimization, 1 \A each of 
Cel-l and enhancer from the kit, was added to 25 \A of 
heteroduplexed PCR product with the DNA and primers 
supplied with the kit. The intensity of cleaved bands was 
used to standardize optimum volumes of home-made 
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Figure 3 Mutant identification in 4x pools and subsequent deconvolution. (A) Identification of a mutant pool in lane 1. (B) Deconvolution 
to identify a mutant individual. In the de-pooling for each plate the first lane has M 2 plant DNA and second lane has M 2 +wild type DNA. Plant in 
Box-1 is heterozygous for mutation. 
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Cel-l required to digest mismatches in the products. Two 
ul of Cel-I was added to the heteroduplexed products 
and incubated at 45°C for 45'. Reactions were stopped 
using 2.5 ul 0.5M EDTA. 

The digested products were visualized on 2% agarose 
gels. Mutants could be identified as those products that 
showed cleaved bands in addition to the full-length, 
uncleaved product (Figure 3). The total number of bases 
scanned was calculated by subtracting 20% of the prod- 
uct size, to take into account the primer base pairs and 
terminal regions that escape detection. Such factors have 
been taken into consideration while calculating total 
coverage in LICOR, PAGE, and agarose gels by other 
workers [12,23,25]. 

Deconvolution and sequencing of mutants 

Pools showing mutation were deconvoluted with a simi- 
lar procedure as above to identify the particular plants 
with the mutation. Each individual member from the 
mutant pool was subjected to two reactions, one of 
which had only the M 2 DNA, whereas the second reac- 
tion had wild type (WT) T. monococcum and M 2 plant 
DNA. This was done to identify the homo/heterozygos- 
ity of the mutant individual. The mutant individual was 
then sequenced on an ABI3730xl (Applied Biosystems, 
Foster City, CA) using the manufacturer s instructions. 

Zygosity was determined by following the basic TILLING 
procedure in the progeny of M 2 plants. Heterozygous M 2 
plants selected for HCT2_A, HCT2_B, COMTl_A, and 
COMTl_B segregated in 3:1 heterozygous: homozygous 
ratio in M 3 generation. Homozygous M 2 mutants gave all 
homozygous M 3 progeny for the mutation. 

Lignin estimation 

Ten seeds were grown from all the identified mutants 
and homozygous M 3 individuals were isolated to estimate 
lignin content. Four mutants each for all the primer pairs 
of genes COMT1, 4CL1, and HCT2, along with WT 
T. monococcum. The spikes of homozygous M 3 individuals 
were collected upon maturity and analyzed for lignin con- 
tent. Lignin content was estimated by an acetyl bromide 
method as described in Chawade et al. [17] with 50 mg of 



dry spikelet tissues in six replications for each sample. 
This method is sensitive for even small sample size, and is 
relatively easy. The least interference from non-lignin pro- 
ducts has been reported in this procedure, because it 
allows complete dissolution of lignin in plant tissue 
and provides precise absorbance values for total lignin 
content [17,52]. 

Calculation for the estimated population size 

Assuming 5% of the mutations in a TILLING population 
to be non-sense, a population size yielding one non- 
sense for each gene of lkb length was calculated. Based 
on the Poisson distribution, at 95% confidence of finding 
at least one non-sense mutation in any gene of 1 kb, the 
required population size should be three times the size 
of this population [Table 3]. 
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